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(57) Abstract 

The present invention relates generally to telecommunications, a 
Existing telephony systems suffer from a number of problems including 
on fixed hardware which results in long time to bring new products to m 
personal computers, without the network performing any processing of 
The invention provides for an intelligent server which executes a separai 
available to that user. This allows for an open and flexible teleconferen 
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id more specifically, to a system and method of teleconferencing, 
system complexity, limited access and implementation of services 
arket Internet applications implement teleconferencing on end user 
lata streams or guaranteeing quality of service in the transmission, 
e mixer for each user, where the mixer is dedicated to the resources 
ce system which can operate over multiple networks. 
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Meth d and System < >f Teleconferencing 

Field flnventl n 

The present invention relates generally to telecommunications, and more 
specifically, to a system and method of teleconferencing. 

Background of the Invention 

Teleconferencing systems allow people at different locations to converse as if 
they were in the same room. In spite of the 
these systems, they are commonly used for 
resulting reductions In travel time and cost. 



currently high cost and complexity of 
business applications, because of the 
However, the cost and complexity can 
not generally be rationalized in other applies tions such as academia and private use, 

areas. 

consisted of single microphone and 
physical location participating in the 
broadcast the loudest voice to all other 



Generally though, these new features 
• networks in terms of higher bandwidth 



so teleconferencing is not common in these 
Traditional teleconferencing systems 
monophonic speaker arrangements at each 
teleconference, and the methodology was tc 

participants, blocking the remainder of the voices. However, the art has been 
evolving and systems are now available whi|;h offer such added features as video 
signals of the participants and stereo sound 
present even greater demands on the carrie 

and lower latency, which results in even higher cost and complexity. This largely 
explains the limited availability and use of sich advanced teleconferencing systems. 

Generally, each teleconferencing syejtem is designed to be used with a 
specific communication network. Presently, 
dominant: the public switched telephone network for voice, and the Internet for data, 
These systems are typically composed of teiminal equipment such as telephones or 
personal computers, an access network suc i as a telephony local loop or a radio 
link, and a backbone network such as the ptblic switched telephone network (PSTN) 
or the intercity data networks. Although the leeds of users at the terminals vary 
greatly, the backbone networks must handle 

operate reliably and efficiently. Therefore, traditional communication networks 
focussed on the provision of single services -ather than differentiation. There is no 
incentive for telephone companies to offer varied features or to serve small niche 
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substantlal cost of developing and 



markets as the revenues would not offset the 
implementing these additional products. 

In voice telephony, services are impler nented by having large computer 
programs running on centralized switches whi ;h interrogate local and distributed 
5 databases. The local databases specify which features are enabled on a given line, 
the switch software interprets these feature lis is and Implements the switch 
behaviour, and the switch software also interrogates the distributed databases via 
Common Channel Signalling^System No. 7 (S 37) queries. SS7 is a global standard 
for telecommunications that defines the procedures and protocol by which network 

1 0 elements in the public switched telephone net\ vork (PSTN) intercommunicate control 
messages for basic call setup, management, stnd tear down, as well as for special 
intelligent or database services such as local r umber portability (LNP), toll-free 
(800/888) services and call forwarding. : 

In PSTN, a user only has access to set vices provided by the local exchange 

1 5 carrier, which in turn may only function within t ie bounds of the SS7 protocol. 

Therefore, users can only access the switches in a limited way, and new features can 
not be added by outside parties. 

Telephony features, such as teleconferencing, may only be implemented by 
adding code to the programs running the switc les or by adding specialized hardware 

20 to the telephony network. The features available to particular users are defined in the 
local databases accessed by the switch softws re, and adding a new type of feature 
may involve changing these databases together with the switch software that uses 
them, and may also involve purchasing and ini tailing new types of hardware in the 
network. 

25 This limits the speed with which new fe atures can be introduced since new 

hardware and software must be designed, tested, manufactured and deployed. The 
inflexible assignment of tasks also makes it im possible to share loads between 
different types of hardware, for example to use idle tone-decoding hardware to help 
with an overload of voice-conferencing or to pn vision a new teleconferencing 

30 feature. 

A traditional PSTN teleconferencing system provides each user with a 
bidirectional audio communication link with eac h of a plurality of remote transceivers. 
Typically, the system includes a microphone ai each location for producing an audio , 
signal from that location and a transport network such as the public switched 
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telephone n twork (PSTN) which deliveries each voice signal to a conference bridge 
This conference bridge mixes the voice signals and returns them to audio amplifiers 
and speakers at each location. 

The conference bridge is implemented as a new hardware component 
connected to the switch providing the service. Adding a new feature sue* as Dolby 
noise reduction or bass boosting requires £ physical change to the hardware and/or" 
software in every switch that offers the service. 

Changes to existing telecommunication networks are therefore very 
complicated to make. There is a rigid mod <3l and hardware structure is difficult to 
extend, so existing telephone companies a a forced to focus on broad services. 
When they do develop new products they inevitably take a long time to bring to 
market and are expensive to implement. 

Telecommunications systems need to process the data flowing through in 
complex ways, often with processing occur ing on computer systems separated both 
geographically and administratively. Many communications paths are simultaneously 
active, and the processing applied to the vs rious flows of data changes frequently 
and in a wide variety of ways. The softwares needed to control these computer 
systems is generally large, complex and dlf Icult to change. 

The complexity of present teiecomnr unications systems software, and the 
extensive interactions between its software components, makes the development of 
new features very difficult. As well, telecorr munication services have traditionally 



been provided by large monopolies who employed proprietary equipment that only 
they had access to. Large telephone comp anies hesitate to allow open access to the 
control of their switches and servers due to the risk of failures and the resulting 
damages that would occur; therefore, only v ery limited access is allowed. 



companiesis therefore limited to a 
reduces the talent pool available and 



Software development for telephone < 
"closed" group of trusted developers, which i 
shuts out developers with new ideas for niche markets. 

In summary, problems with the PSTN include: 

1 . system complexity results in long tirr e to bring new products to market; 

2. cost of services results in focus on fow specific services rather than diversity 
and nich markets; 
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existing services are provided by dec icated hardware and software which are 
inflexible and must be physically, anc often manually, modified to offer new 
services or features; and 
4. only proprietary access to switches and their software code is allowed. 

The implementation of software applications in an Internet environment is 
generally done by the software running at thi 5 endpoints, and the IP (Internet 
Protocol) network is treated merely as a con iutt for transfer of data packets between 
the two points. The routers in the IP networt; merely index internal routing tables 
using the address of data packets so that th« ly know how to forward them, and do not 
generate data for either of the endpoints, or i eact to instructions from either of the 
endpoints. The Internet itself may be envisic ned as a series of routers 
interconnected by an Internet backbone network designed for high-speed transport of 
large amounts of data. Users may access the Internet using personal computers in a 
number of manners including modems conneicted to the Public Switched Telephone 
Network (PSTN), or set top boxes connectec to existing telephone or television cable 
networks. 

Communications over the Internet can be administered using various 
protocols, over a variety of physical transfer media. A protocol is a set of conventions 
or rules that govern transfer of data between hardware devices. The simplest 
protocols define only a hardware configuration while more complex protocols define 
data formats, error detection and correction techniques and software structures. 

The key advantages of a protocol like IP are that it allows a large network to 
function efficiently and that it offers a standardized means by which applications 
software can use that network. The main disadvantages are: 

1 . that it does not allow processing to be performed on data streams; and 

2. that it does not allow quality of service i to be specified. 

For example, the Internet generally will not let a user run an applet on a node 
or server. This limitation is due to the architecture of the Internet which is based on 
the international OSI (Open Systems Interconnection) standard. The OSI standard 
describes communication systems using a seven layer model, each layer being 
operable to perform certain functions. Although OSI is not always strictly adhered to 
in terms of keeping related functions togethei in a well-defined layer, most 
telecommunication products make an attemp : to place themselves in relation to the 
OSI model. The OSI standard is not lik ly to change dramatically, nor is th 
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Dt allow quality of service to be specified. 



total end to end transfer rate is still not " 



Internet's use of the standard, so the Internet will not likely become an active 
component in the provision of telecommunication services 

More importantly, the Internet does n 
Internet communications generally rely on th 3 transport of data packets over various 
heterogenous networks, so even though certain links may have predictable data 
rates, for example, a privately owned T1 line 
predictable or dependable. 

Some protocols such as resource reservation protocol (RSVP) set tags and 
priorities which can influence the routers on t\n Internet path a little, but not a great 
deal. The RSVP is an extension to IP that permits specification of quality of service 
at a technical level, in terms of parameters such as data rates and latencies. It has 
had limited acceptance due to the complexity 
need for their switching hardware to be upda 
unless all switches in the end to end connect 
is not generally the case. 

Therefore, typical software applications operating over the Internet, such as 
teleconferencing, look at the Internet as simp y a transport network without any 
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processing capability and all functionality is p 



Implementations of teleconferencing over Internet for example, have software at 



each user's personal computer (PC) that acts 



converting voice to data packets for IP transmission to each of the other participants 
in the teleconference. Accordingly, the user's PC also receives streams of voice data 
from each of the other participants in the tele< inference and plays them through a 
sound card. 

This implementation suffers from severe scalability problems. For example, if 
there are ten participants in a teleconference, then each participant would require 
sufficient bandwidth to download nine simultaieous voice data streams from the 
other participants, in real time. As the bandwdth to each user would increase linearly 
with the number of participants, and the load on the network increase with the square 
of the number of participants, there would be an immense load on the network 
resources. Clearly, this is impractical for teleconferences with a large number of 
parties or services which themselves require high bandwidth such as video or high 
quality voice. Even if the bandwidth could be obtained, there is no way to ensure that 



it adds to backbone networks and the 
;ed. As well, little is accomplished 
on are responsive to the protocol, which 



aced at the participants locations. 



as the interface with the user, 
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ng teleconferencing system is designed 



it is consistently availabl , as there is no way to specify quality of service (QoS) in 
Internet applications. 

As noted above, typically, each exist 
to operate over a particular network and is nrt capable of cooperating with the many 
varied networks now available. These networks include public switched telephone 
network (PSTN), Internet, cellular telephone systems, satellite communications, local 
area networks (LANs) and wide area networks (WANs). Within these networks there 



wireless or hardwired electrical 
over these networks in analogue or 



system which is capable of 
combination of communications 

letworks, for example, use standard 
setting up connections, and have 



are a variety of media including optical fibre, 

connections, which execute communications i 
1 0 digital format using a variety of different prot< >cols. Many of these networks have 

been widely implemented, at considerable a iprtat cost, so it is unlikely that they will 

be quickly abandoned and a new, standard, world-wide telecommunications network 

constructed. Therefore, there is a need for £ 

implementing teleconferencing over a mixed 
15 networks. 

Asynchronous Transfer Mode (ATM) 

protocols for addressing packets of data and ! 

typically been deployed in the core of backbone networks because of the high 
speeds at which ATM equipment operates. Hecause ATM routers are not directly 

20 accessible and because of the complexity of their mechanisms for describing QoS, 
these mechanisms have not been used by applications software. 

Besides the IP and ATM networks me ntioned above, there are other data 
networks such as Frame Relay and Ethernet As well, the PSTN may also be used to 
carry data, for example using trellis coding which maps digital data onto an analogue 

25 signal commonly; which is commonly used by Personal Computer modems. Variants 
are also evolving of each major type of network, and engineering differences 
between implementations of these networks result in different performance. The 
complexity induced by this variety makes it difficult for users and application software 
to exploit all the networks available, and to exploit any to its fullest extent. 

30 Feature development is already diffici It for the simple application of 

teleconferencing over voice networks. As new media such as videophone, typed 
messaging, shared files and whiteboards are mixed with traditional teleconferencing 
products, and new applications such as distance learning, Internet Relay Chat and 
Internet gaming, develop, the problem is becoming even more severe. This probleni 
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will grow even greater as expectations 
mapped into another, as when customers 
apply in videoconferencing or Internet 

Furthermore, even for a single app 
needs, for example, requiring different deg 
there is a need for a system which can 
becoming complex, slow to develop and 

There is therefore a need for a 
may be implemented over mixed 
complexity of such existing networks to 
architecture. 



deyelbp for features from one domain to be 
xpect a feature similar to call-waiting to 



gating. 

ication, different users may have different 
degjrees or forms of encryption; Therefore, 
allow many cases and features without 

slpw in operation, 
metjhod and system of teleconferencing that 
telecommunications networks, and addresses the 
provide an open, scalable and flexible 



input 



Summary of the Invention 

It is therefore an object of the invention 
teleconferencing which addresses the problems 

One aspect of the invention is broadly 
teleconferencing comprising: three or more 
input and an audio output; a telecommunications 
terminals and operable to transport data tc 
modular mixing software for each respective 
telecommunications network, and operable 
the audio outputs of the others of the user 
audio signals into a signal for the audio 
correlates to the needs of the respective ulser 

Another aspect of the invention is 
comprising: means for interconnecting us^r 
from the user terminals; means for executing 
each respective user terminal, the separata 
means for receiving separate audio signals 
user terminals; and means for combining t 
the audio input of the respective user terminal 
respective user terminal. 

An additional aspect of the invention 
teleconferencing comprising the steps of: 
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to provide a method and system of 
described above, at least in part, 
defined as a system for 
user terminals, each having an audio 

network interconnecting the user 
and from the user terminals; separate 
user terminal, executing on the 
to receive separate audio signals from 
terminals; and to combine the separate 
of the respective user terminal which 
terminal. 

Refined as: a server for teleconferencing 
terminals and transporting data to and 
separate modular mixing software for 
modular mixing software including: 
from the audio outputs of the others of the 
ie separate audio signals into a signal for 
which correlates to the needs of the 



is defined as: a method of 
deceiving, at a separate modular mixer 



10 



15 



20 



25 



30 



WO 00/19693 



-8- 



CT/CA99/00875 



user terminals and executing on a 



representing a respective one of three or more 
telecommunications network, separate audio signals from audio outputs of the others 
of the user terminals; and combining the separate audio signals into a signal for an 
audio input of the respective user terminal whitph correlates to the needs of the 
respective user terminal. 

A further aspect of the invention is defined as: a computer data signal 
embodied in a carrier wave, said computer data signal comprising a set of machine 
executable code being executable by a computer to perform the steps of: receiving, 
at a separate modular mixer representing a respective one of three or more user 
terminals and executing on a telecommunications network, separate audio signals 
from audio outputs of the others of the user terminals; and combining the separate 
audio signals into a signal for an audio input of] 1 
correlates to the needs of the respective userl 

A still further aspect of the invention is < 
storage medium storing a set of machine executable code, the set of machine 
executable code being executable by a computer server to perform the steps of: 
receiving, at a separate modular mixer representing a respective one of three or 
more user terminals and executing on a telecommunications network, separate audio 
signals from audio outputs of the others of the i 
separate audio signals into a signal for an aud \ 



the respective user terminal which 
erminal. 

defined as: a computer readable 



user terminals; and combining the 
o input of the respective user terminal 



which correlates to the needs of the respective user terminal 



will become more apparent from the 
to the appended drawings in which: 
system in a broad manner 



Brief Description of the Drawings 

These and other features of the invention 
following description in which reference is made 
Figure 1 presents a physical layout of a teleconferencing 

of the invention; 

Figure 2 presents an exemplary physical layoiit of a teleconference system in a 

preferred embodiment of the invention; 
Figure 3 presents a block diagram of exemplary 

preferred embodiment of the invention 
Figure 4 presents a block diagram of an exemplary 

a preferred embodiment of the invention 



signal processing software in a 
and 

operating system architecture in 
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Detailed D scripti n of Preferred Emb dm fits of the Invention 

A system which addresses the objec;s outlined above, is presented as a 
physical layout in Figure 1. This figure presents a system 10 for teleconferencing 
between three or more user terminals 12, 1i , 16, each having an audio input and an 



5 audio output. The phrase terminal" is used 



suitable manner of user audio input and output device including telephones, cellular * 
telephones, and personal computers with microphones and speakers or headsets. 
The audio input and output refer to the conn actions between the terminal and the 
telecommunications network, and not to the audio interface between the user and the 
10 terminal. 

A telecommunications network 18 intarconnects these user terminals 12, 14 , 
16 and has the necessary functionality to transport data packets between them. The 
telecommunications network 18 also executes separate modular mixing software for 
each respective user terminal 12, 14 , 16. Tiese separate mixers 20, 22, 24 are 

1 5 operable to receive separate audio signals f/ om each of the other user terminals and 
to combine those separate audio signals intc one signal. This one mixed signal is 
transported to the audio input of the respective user terminal, in a manner that 
correlates with the needs of that respective i ser terminal. 

That is, if there were three participant s in the teleconference, there would be 

20 three mixers, a first mixer for participant A, v\ hich mixes the audio output signals of 
participants B and C, a second mixer for par icipant B, which mixes A and C, and a 
third mixer for participant C t which mixes A a nd B. 

The use of Individual mixers 20, 22, 24 executing on the telecommunications 
network 18 addresses a number of the probli *ms noted above. 

25 Firstly, having individual mixers 20, 2: 1, 24 allows each mixer to be dedicated 

to its own user and to be tailored to the limits tions of the user's resources and the 
resources of the network and network conne rtion that services that user. For 
example, the mixer of a user having a high b andwidth connection may provide digital 
quality stereo to its user, with balanced mixirc 

30 user connection via an analogue, monophon < 

strongest voice to its user, blocking voice signals from other participants and thereby 
reducing noise. 

Having a single mixer for all participants, as taught in the PSTN art, would . 
require an immense pi ce of software code i all of the variations in user 



generally in the art to describe any 



g of all participant's voices. Another 
c, PSTN connection may send only the 
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den and i 



other; 



requirements were to be handled in a single 
software code would be unmanageably largs 
have simply not offered such diverse services 
Similarly, the use of mixers 20, 22, 24 < 
5 network 1 8 offers a substantial improvemen 
teleconferencing as well. Typically, Internet 
all participants, so each terminal receives 
participants. This places a tremendous 
connection to each user and a tremendous 

1 0 invention requires only the number of audio 
audio output, to be sent to the user. That is 
only one channel is required, and if stereo is 
sound, surround sound, central bass and 
corresponding numbers of audio channels 

1 5 required to each user and the loading on the 
Thirdly, it is also significant that the 
a modular manner. As will be described in 
software components of the invention are 
small modules designed to handle very 

20 than those like the existing PSTN. Themor^ 
addresses, the easier it is to design that 
of the software system. This is fundamental 
flexible and open. 

Other advantages of the invention wi 

25 description of the preferred embodiment 
example. 

Figure 2 presents an exemplary 
preferred embodiment of the invention, havirjg 
have direct access to an active network, whi 

30 term "active network" refers to a network tha 

software and other related software components 

Participant A has a personal compute r (PC) 
network 28 via a wireless connection 30. Th|e 
serves Participant A is called a NetPort 32. 



specific 



module 
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piece of software. As this piece of 
complex and slow, existing systems 

executing on the telecommunications 
over existing Internet based 
methods broadcast all voice streams to ~ 
to (N - 1) streams where there are N 

on the bandwidth of the final 
oad on the network, in contrast, the 
channels that the user requires at his 
if the user desires monophonic output, 
desired, two channels. Quadraphonic 
audio arrangements would require 
This greatly reduces the bandwidth 
network. 

nhixers of the invention are implemented in 
£ reater detail hereinafter, all of the 
implemented in small modules. Having 
tasks results in a far simpler system 
defined the task that a module 

and later, to integrate it into the rest 
to the provision of a system that is 



become more apparent from the 
whifch will be presented in terms of an 



physical 



layout of a teleconference in the 
four participants. Two participants 
e two are connected to the PSTN. The 
is operable to execute the mixer 
described hereinafter. 
26 connected to a first active 
ntity on the active network 28 which 
he specific role of the NetPort 32 within 
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the active network 28 will be described in greater detail hereinafter. The PC 26 is 
running a stereo-enabled Web browser with a RealAudio plugln that implements 
streaming audio and is output at speaker 34. The PC 26 also has a simple 
microphone plugln that passes samples fro n the on-board microphone 36 back to an 
IP address. Participant A also has a WebCam 38 connected to her PC 26. 

Participant B is connected directly tc i a second active network 40, but in a 
location geographically remote from the firs : active network 28. Participant B also 
listens from a speaker 42 connected to his 3 C 44 through a streaming-audio 
application, but is talking through a wireless telephone 46 for its boom microphone. 
Participant B is plugged into a second NetPort 48 through a telephone jack and 
hardwired connection 50. 

Participant C is connected to the PCTS 52 (plain old telephone system) via a 
plain black rotary-dial telephone 64. 

Participant D has two speaker phonos 54, 56 fed by two separate POTS lines. 
He also has an Internet connection via a PC 60 which runs a Web browser, but his 
Internet Service Provider (ISP) does not provide good enough quality of service 
(QoS) for voice, so he just uses it for the gniphic user interface (GUI). 

A GUI is piece of software that prese nts data to users in a graphical manner, 
allowing for easy interpretation and modification. It is preferred that the invention be 
implemented in such a manner, where poss ble. The GUI runs as Java in the PC 
browser, and communicates with call procei sing applications running on the active 
network by means of sockets. Invoking it in' ralves typing a URL (uniform resource 
locator such as "coolPhones.com"), after which it sits in a window waiting for an 
incoming call or a user input event to place a call. Inputs can be made via a mouse, 
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keyboard, trackball, touchscreen, joystick or 



an Internet network 62, which is 



an interface, though, since it is unacceptable, for example, to have voice-mail fail 
when the PC is not active. Therefore, the rejal call processing decisions are made on 
the active network side. 

This exemplary system also includes i 
connected to the PSTN using, for example, H.323 and SIP (Session Initiation 
Protocol) connections. These connections are known in the art, as are others. The 
Internet network 62 is also shown to be connected to both active networks 28, 40, but 
many other system topographies are also pc ssible. The invention is not limited by 
any particular topography. 



other similar manner. The GUI is strictly 
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While the teleconference is in progre ss, Participant A hears Participant B on 
the left, Participant C in the middle and Part cipant D on the right, because Participant 
A is mixing the other participant's monophonic voice streams into a metaphorical 
stereo spectrum. The use of the stereo output has two major advantages. Firstly, it 
aids in identifying which participant is currently speaking. Secondly, it allows higher 
noise levels to be tolerable to participants due to the "cocktail party" effect This 
effect acknowledges that people are able to 
an environment where there is considerable 
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converse comfortably with one another In 
background noise, provided that they 



have a means for identifying and focussing their attention on a particular speaker. 



The use of stereo sound has been shown to 
Stereo sound can be synthesized in 
implementation, amplifier gain can be varied 



provide this identification. 
; a number of manners. In a simple 
between the left and right channels, for 



example: one participant may be played at full volume on the left channel and none at 



all on the right, a second may be played with 



1 5 none on the left, while a participant may be played at equal volume in both channels 
More complex implementation of stereo may, for example, introduce a delay to the 
audio signal before playing it on one of the c lannels, simulating the additional time 
the sound takes to travel to the farther of the listener's two ears in a regular physical 
environment Such methods are generally kiown to those skilled in the art. 

Accordingly, Participant A's GUI scre3n shows a Web page with Participant 
B's picture on the left, Participant C's number in the middle and Participant D's caller- 
ID name on the right. In each case, the GUI < 

information that it has available for each part cipant. This identification may include a 
URL or telephone number. As well, "screen- pop" information could be provided 
which Identifies personal information about the participant such as his address. 

As well, via the GUI, Participant A can click on "ear" and "eye" icons for each 
of the three participants on her screen to ensble or disable their ability to hear her or 
to see her through the WebCam. Participant i 



A can also drag on a "mouth* icon to set 
the volume level at which they speak to her, knd drag participants left and right to 
adjust their stereo imaging. Alternatively, the volume level of participants could be 
governed by their physical location on the GUI display - the further a participant's icon 
is away from an icon repres nting the user, t ie lower their volume level. Another 
preferred feature of the GUI is that when someone speaks, th ir icon becomes * 
brighter then gradually fades again with inact vity . 



full volume on the right channel and 
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Similarly, Participant B hears Partict 
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pants A, D and C in a metaphorical stereo 



spectrum from left to right. Because he finds the sound of participant A's voice close 
to that of Participant C, he has chosen to separate them spatially as much as 
possible. This is done with the same type of GUI described with respect to 
Participant A above. 

Participant C hears a conventional f|>ur-way conference call, with the voices ~ 
of the three other Participants companded jind mixed together. As a result, she has 
difficulty distinguishing Participants B and D. However, she has the flexibility to tailor 
the call to some extent with the preference 1 or single voice dominating, adding noise 
filters, or other functionality via her proxy. This addition of functionality directly to 
PSTN customers is very significant. As explained in the background, PSTN services 
are driven by a supply model that only provides commodity services, and takes a 
long time to provide those limited services. There is a vast PSTN infrastructure which 
provides single monophonic lines into millions of homes and businesses, all of which 
are shackled with these limitations. The uses of proxies in the manner of the invention 
provides greater flexibility and access to new services which may be implemented 
quickly and at very low cost. More details a -e provided hereinafter regarding the 
preferred use of proxies. 

Participants who do not have the capability of interacting with the active 
network will have generic proxies assigned to them which are dictated by the nature 
of their telecommunication connections. For example, the system knows that a 
Participant only has PSTN access if that is t ie connection the call manager has 
identified as the best connection during call setup. 

Participant D has a similar stereo arrangement, over which he has defined 
Participant A to reside on the left speaker te ephone, Participant D on the right 
speaker telephone and Participant C on both channels. This arrangement also 
creates a metaphorical stereo spectrum. Ot ler means are known in the art for 
carrying stereo over the PSTN, but such me hods generally require more complicated 
hardware at the Participant's end. Participant D has the same GUI as Participants A 
and B so he is able to control his proxy and mixer on the active network directly. 

As an example, exemplary signal processing software for participant A is 
presented as a block diagram in Figure 3. Voice streams from other participants 
arrive in different forms and need to be convBrted to a consistent form, companded 
and then mixed. They also need to be trans Tiitted across th radio link in an efficient 
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(PC VI) 



the 



rot i 



form, and other signal processing such as echo 
to be done on the voice data. In this case, RealAudio 
consistent form though MP3 or a number of 
particularly convenient as it is a realtime streaming 
5 industry and for which many tools are currently 
This type of processing is also required for the 
different forms for each participant which corre 

Not shown in Figure 3 are such functions 
level control, though their implementation follows 

10 provided herein. 

Specifically, pulse code modulated 
Participants C and D, which are connected to 
is the standard transmission form for audio in 
received from Participant B is already in RealAudio 

1 5 packets and is easily transported over IP, it is 
passing its signal to the RealAudio mixer 68. 

The RealAudio mixer 68 combines the i 
with the participant's requirements, In Figure 
line is shown which interfaces the RealAudio mjxer 

20 card 72. The audio output of the RealAudio mifcer 
well, and also to a PCM converter 74. 

This PCM converter 74 feeds the echo 
more or less matches the output from the participant' 
echo cancellor 76 can remove the speaker output 

25 up by the microphone 36. The PCM signal 
converted to RealAudio at the voice coder 78. 

RealAudio packets are numbered sequentially 
arranged in the proper order when they are deqoded. 
to time stamp packets as the time delays are 

30 packets that result from their transport from different 
will not generally be detectable by the participants 
deliberately add a delay to the incoming signals 
signal jitter. As the data is arriving in finite and 
inevitably be some degree of jitter, so buffering 
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cancellation/suppression may need 
has been chosen as the 
forms qpuld be used. RealAudio is 
standard that is well known in the 
available, such as codecs and mixers, 
other listeners, and it takes slightly 
ate to their respective setups, 
as encryption, tone controls and 
logically from the description 



voice streams are received from 
RealAudio converters 64 and 66. PCM 
PSTN. Since the voice signal 
io format, which comprises data 
necessary to convert it before 



I leaving 



ipcoming audio signals in accordance 
, a bi-directional Activity and Control 
68 with the PC 26 via an Ethernet 
68 goes to the Ethernet card 72 as 

^ancellor 76 with an audio signal that 
:'s speaker 34. This way, the 
signal that is inadvertently picked 
the echo cancellor 76 is 



to ensure that they are 
Generally, it is not necessary 
; and the varied delays in data 
sources, or by different routes, 
In fact, RealAudio may 
by storing them in a buffer to absorb 
distinct data packets, there will 
is preferred. A buffer that causes 20 
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mSec - 50 mSec delay is sufficient time to a bsorb the effects of jitter most of the time, 
and is not long enough to annoy the users t ) a great xtent. 

Also, note that all audio signals passing between the NetPort and NetPort 
Manager are In RealAudio form, and not PCM. This makes the transport over the 
digital interconnection more convenient. 

Other preferred aspects of the teleconference are outlined as follows: 
Call Setup 

The most important aspect of call setup is the identification of the participants, 
where they can be found and then ol course, creating the connections. In the 
preferred embodiment, the teleconfe 'ence will be created by one or more of 
the participants who are GUI-enabled These participants will advise the 
network of the identities of the particiDants and the call setup software on the 
network will make the connection wit i the participants. 
Some of the participants will have Internet addresses, while others will have 
telephone numbers. In each case, tt e call setup software will investigate the 
participant and establish the best possible connection that it is aware of. 
Those participants without Internet access will be assigned proxies which 
reflect the resources they have access to. For example, if the call setup 
software identifies a PSTN telephone number as the best connection, it will 
assign a PSTN proxy to that participant unless advised otherwise. 
In the preferred embodiment, all part cipants who are GUI-enabled can add 
participants to the call, but for high-socurity teleconferences, addition of 
participants should be controlled by z single participant 
Telecommunications Operating System 

The telecommunications operating system aspect of the invention provides 
unified control and access to all syst€ m resources and networking links, with 
the functionality in and implied by Fifl ure 3. This represents a large collection 
of signal processing and control functions connected together in response to 
the commands of the callers. 
This contrasts with the "pure Internet 
cooperating tasks in all of the various 



makes it very difficult to optimize and 



Protocol" approach which require 
computers to arrange to do their parts 
of the processing through an applicat on-specific protocol built on socket() 
calls, with no single program having Jin overview of the whole setup. This • 



manage the system, and each such 
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application has to reinvent call process 
part of the underlying implementation 
invention is built as a middleware layer 
In "telephony classic 0 one would not 
5 generality in software, but would make 

hardware that assumes all inputs and 
numbers to call in order to connect to it 
makes this a 'closed system 0 , in which 
limitations on who can develop new 

10 3. Proxy 

It is preferred that the invention be 
graph model as described in the 
Patent Cooperation Treaty, Serial No 
for Telecommunications". While a si 

1 5 and manage the entire connection 

well as voice streams, that application i 
"agents" from "proxies". 
A proxy is a piece of software that acts 
connection. In this case Participants A 

20 and Internet providers are separate 

contains data that represents the 
whether Participant A is already on the 
network's 28 voice trunk is getting full, 
do specific tasks, such as responding to 

25 managing a voice call in progress 

The terms proxy and agent are 
For the purposes of this document, they 
agents, each of which handles a specia 
not comprise an immense block of code 

30 in its simplest form, is merely a 

as required, discarding them when their 
These agents are sent to parts of the 
going on and are connected to the signal 
through' a "controlling application". This 



ng. The invention uses socket () as 
a preferred embodiment, so that the 
on top of IP. 
attempt to set up something with this 
special "stereo conferencing server* 
cjutputs are PCM and would add speciaT 
The need for specialized hardware 
innovation is slowed down by 
telecommunications applications 

appl ed to a network which employs a 
co-perjding patent application under the 
_, titled "Connection Manager 



sometimes 



m 
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single application program is used to see 
including IP and videoCam connections as 
constructed by collecting together 

on behalf of a specific party to a 
through D. and each of the networks 
with separate proxies. A proxy 
preferences and state of the party, such as 
elephone and whether the first active 
has components that are agents to 
off-hook on a telephone and 



paries 



e nd I 



used interchangeably in the art. 
are distinct: a proxy is built out of 
situation. Therefore, the proxy does 
with all conceivable functionality, but 
which instantiates software agents 
tasks have been completed, 
system in which signal processing is 
processing code or hardware 
architecture of proxies, agents and 
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controlling applications is what allows connection management applications to 
appreciate the whole structure of a connection while still being "owned" by 
several different parties. 

Proxies should persist in the preser ce pf component failures, so that, for 
example, a user's forwarding instructions do not get lost during a crash. It is 
preferred that persistence be provic ed via a distributed database which is 
continuously updated, so that all concerned parties are aware of the status of 
the communication. In the event of a failure, the system is able to work 
around the failure, allowing the corr munication to continue. Such 
transactional interaction techniques are known in the art. 
In the "pure Internet Protocol" approach there is only custom software running 
on hardware belonging to the various parties involved and communicating 
through socket() mechanisms in an ad hoc protocol. The invention builds an 
additional structure on top of this. 

In "telephony classic* there is a sinc le very large program that looks at a 
database for all users and decides what they would want to do. This program 
is too large to modify quickly, and can only be modified by the equipment 
manufacturer. Again, this approach is not flexible enough for rapid evolution 
of new features. The architecture of the invention makes it easier to 
understand and modify software, without the same complexity, allowing the 
system to be open to software development, so that new features may be 
brought to market very quickly. 
Graph 

It is preferred that the invention be applied to a network which employs a 
graph model as described in the co- pending patent application under the 
Patent Cooperation Treaty, Serial No. titled "Method and System for 

Configuring Communications Systems*. Briefly, the graph model constructs 
the signal processing and communi :ations structure as a mathematical graph, 
which is later implemented by takinjj "filters" that implement the nodes out of 
libraries and modifying them, either by a dynamic linking process or by setting 
the IP addresses to which they make socket connections, to have the 
interconnection structure specified by the edges. This graph is also used for 
communication among the agents, as the data structur that defines a 
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be 



connection. An API layer that describes 
added above the raw graph structure 
In addition to filters, it is also preferred 
calls to proxy agents required to set up 
agents to collaborate on building and 
An application programming interface 
simple and high level functions into the 
execute those functions, simplifying 
Windows APIs, for example, a program 
message boxes, as weir as perform m0re 
single instructions. 

The particulars of how an API for the 
but it is desirable that a standard API 
connection and negotiation processes, 
standard API simplifies the creation of 
A GUI is particularly well suited to the 
present the assembled filters as defined 
a very logical and understandable form, 
the functionality to let the user modify 
filters and their interconnections. 
In the "pure Internet Protocol" approach 
is not visible at all, while in the "telephqny 
switch software to connect physical 
of libraries. The decision about what 
made by the users by dialling telephon 
approach. 

Real Time Operating System (RTOS] 
Voice teleconferencing is a real time 
known in the art. Generally, RTOS's 
units of threads and functions, and therji 
threads and functions to be performed 
Distributed Operating System 
A distributed operating system is one in 
on different nodes. In the case of a 



characteristics of these graphs is 
assist in writing agents, 
that this graph data packet contain 
the call. Proxies may also send their 
nhanaging graphs. 

(API) converts a series of comparatively 
lower level instructions necessary to 
i of an operating system. Using 
can open windows, files, and 
complicated tasks, by executing 
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invention is Implemented are not critical, 
employed that expresses control, 
including payment. The use of a 
lew features by third parties. 
i|se of a graph model, as the GUI may 
by a graph data packet, to the user in 
It is also preferred that the GUI have 
graph data packet simply by altering 



tiei 



I poits 



ports 



the overall communications structure 
classic 0 approach H is possible for 
together, but not to pull functions out 

to connect together is explicitly 
numbers in the "telephony classic" 



pncedure, so RTOS should be used as 
divide code to be executed into smaller 
schedule the execution of these 
Drior to specific deadlines. 



which portions of the software can run 
telecommunications syst m, distribution 
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of software makes It easier to maint ain real time operation as there are more 
options available to schedule timely 
improves scalability and speed. Tho 

the efficient use of a distributed system, in that agents and proxies may be 
assigned to run on different nodes of the system. Ideally, agents will be 
located close to where they are required, to minimize time delays in 



communicating with the entities the) 



RTOS is described in the co-pendinjj patent application under the Patent 
Cooperation Treaty, Serial No. 



User processes running as 'filters", 
come from being in the signal path, 
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execution. As well, distributed operation 
use of agents and proxies lends itself to 



represent. Such a suitable distributed 



..titled "Distributed Telco" 



Figure 4 presents a block diagram of an exemplary operating system 
architecture in a preferred embodimsnt of the invention. A distributed 
communications substrate 80 is inte posed between user processes and the 
underlying machines, so that processes can generally be moved from one 
machine to another without being av/are of it, either to distribute load or to 
recover from failures. 
Processes running in the system co ne from different sources and accordingly 
get different treatment in terms of the trade-off between security and 
performance. Call-processing functions acting on behalf of the end users run 
in a protected "sandbox" environment on a virtual machine. Those working 
on behalf of the network provider may run there, but may also be 
implemented directly as processes running on the network operating system. 



with the hard real-time demands that 
and also run directly on the 



communications substrate 80. Processes belonging to different users are 
protected from each other by the us jal operating system mechanisms such 
as memory mapping and file privileges, but the source is also reviewed by the 
network administrator. Filter processes on the same machine and part of the 
same call may share an address space and a thread of control, with data 
being passed with a function call mechanism and with connections to other 
hardware being handled by a stub that adapts a function call to a socket-type 
mechanism. These filters would still be dynamically linked, even with the 
function-call mechanism. 

Signals pass through filter processes F, which also implement drivers and 
performance-sensitive functions on behalf of the network. Call processing on 
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allocated to a physical network: each 
nodes, for example, and sometimes 



each of N participants, each mixer w 



mapper must balance the benefit of c listributing the mixers among various 



processors against the extra cost of 
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behalf of users is handled by CP pre cesses running in a secure virtual 
machine VM environment, which also includes checkpointing functions that 
can transfer control on failure to a M £ host machine" 82. All these processes 
run on the common software communications layer 80, which places them on 
appropriate physical systems and arranges for their connections. Server 
processes S also run on the communications substrate 80, but do not have 
the hard real-time constraints of the filter processes. Secure call-processing 
functions are one type of server process 
Mapper 

The allocation of tasks and network fcapacity to different communications 
graphs is done by an optimizer called the mapper. There are in general 
many ways that a graph representing a desired communication can be 



of the filters can run on several different 
there are several types of links over 
which data can be carried. The simplest embodiment uses hints from the 
proxies about where to put radio links, that is, after the voice coder, and then 
applies a "greedy algorithm" to put computing resources as close to the net- 
work edge as possible. A good map aer should be a distributed application in 

oca! knowledge to the greatest extent 
possible. It is not essential to get a global optimum, as long as resources are 
not seriously wasted. 

In the case of the invention, the most significant resource management 
problem is the handling of the voice streams. In having a separate mixer for 



receive (N-1) voice streams. The 



ransporting redundant audio signals. 



The factor that governs these decision in the implementation of the invention 
is quality of service (QoS). Methods of distributing such real time loads are 
known in the art, but in the case of the invention, the solution will vary with 
each set of participants and network :opography. 
Negotiation 

It is preferred that the architecture fo| agents provide for use of the negotiation 
system described in the co-pending patent application under the Patent 
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Cooperation Treaty, Serial No.. 

Negotiating Telecommunication Resources 



Many users are competing at once 
(including its computing capacity) 



frequent. With differential service s 
defined, but the definition needs to 
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titled "Method and System for 



i : or the shared resources of the network 
t is preferred to apply a market model to 
resolve this contention: agents for t le parties involved offer and demand 
payment as part of connection setu d, and a connection does hot happen until ~ 
all parties have accepted it. A caller can choose to try a connection at a 
reduced quality level if the cost of tl ie high quality connection becomes too 
high. For example, on Christmas Day, the load may be temporarily high, so 
users can expect to get through wit i reduced voice quality rather than getting 
a busy signal. 

In "pure Internet Protocol", temporary congestion is resolved on a "best 
efforts" basis and packets may be almost arbitrarily thrown out, and at a 
longer time-scale by overprovisioniijig the network so that failures are not too 

small number of priority classes are 
i be managed. The market model of the 
invention can be used to manage dfferential service, allocating high priority 
access in such a way as to permit guarantees on service. 
In "telephony classic" contention is managed by call admission (first-come, 
first served) and again the network is overprovisioned so that failures are not 
too frequent 

Negotiation management may be implemented by having a negotiation agent 
for each of the user terminals and f Dr each of the multiple telecommunications 
networks. Each negotiation agents is operable to execute somewhere on the 
telecommunications system, for example, on the active network, and 
represents the interests of its respective party in negotiating communication 
over the telecommunications network. This is done by identify participants in 
the negotiation and then passing a graph data packet which describe the 
proposed connection, to each participant for their consideration. Each 
negotiation agent may either accept, reject or revise it to make a new 
proposal to the other negotiation acients. 

When all or part of the graph data packet is to be executed, a device simply 
assembles the listed filters in the manner defined in the graph data packet. 
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implem ht & with a strong security 



invention include: 



It is also preferred that the invention be i 
mechanism that protects proxies from erroneous or malicious code in other proxies. 
As well, it is presently desirable that proxies ar d agents be written in Java™, but 
another language with similar advantages could also be used. Advantages of Java™ 
5 include: 

a. excellent security 

b. a large community of experienced developers 

c. object oriented code structure 

d. simple net-based distribution mechanism 
10 A telecommunications system impleme ited with the functionality described 

above provides a foundation for the mixed media applications of the future, and for 
greater flexibility and power to existing service^ such as high bandwidth telephone, 
and Internet gaming. 

Other options for implementation of the i 
15 1. Companding 

Companding techniques use "compression" algorithms that try to adjust gains 
(smoothly) so as to keep a signal's level more constant and "expansion" 
algorithms that adjust gains to exaggerate signal-level variations. Some 
techniques used in audio are frequency - 
20 companding which adjusts filter cutoffs 

signal levels are low. 

An extreme example of expansion is "squelch" in which signals with power 
level below a certain threshold are turned off completely to minimize idling 
noise. In telephony the most common variant is "echo suppression", as 

25 opposed to "cancellation", in which the signal path from the quieter user has 

its gain reduced, which reduces the loo o gain for echoing and feedback 
oscillation. Companders use around 5-50 operations per sample. 
Instantaneous companders work on a sample-by-sample basis, and the 
common A-law case is covered under "coders" below. 

30 2. Voice coding 

Voice coders are used to reduce the bandwidth requirements for voice 
signals. There are many types, but broadly they can act on the waveform, 
minimizing some math matical measure like error power; they can model the 
source; or they can model what the ear will notice. Coding for compression is 



-dependent, such as Dolby 

to suppress background hiss when 
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an active research area, and a steajdy stream of new coders is likely to . 
appear. 



'Telephony classic" uses waveform 
law). Sampling is done at 8kHz on 



coding in the form of 8kHz A-law (or p- 
a signal filtered to pass the range from 



300Hz to 3300Hz. The passband was defined to get good subjective scores 



on speech quality and intelligibility, 



stereo 16-bit fixed-point sampled at 
the bandwidth and the use of a T1 
sample to sample, the same quality 
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and the sample rate is designed with a 



33% margin over the Nyquist minin urn in a trade-off between network and 
prefilter costs. A-law and p-Iaw are specialized 8-bit floating-point 
representations, chosen as a way to get roughly constant signal-to-noise over 
a wide range of signal levels. By comparison, compact disc (CD) sound is 

44.1kHz, which requires roughly 24 times 
ine. Because speech varies slowly from 
can be had for roughly half the bandwidth 



with ADPCM (adaptive differential pulse-code modulation) which, roughly 
speaking, digitizes the derivative in itead. 

Most digital cell-phones use a variant of linear prediction coding, which tries to 



model the incoming sound in terms 
cords or airflow and which in turn d 



requires less bandwidth than waveibrm coding because the larynx moves 



more slowly than the waveform, bu 



listeners know the connection is sti 
Source coding is difficult to use for 



of a sound source that simulates the vocal 
ives a filter that models the larynx. This 



works badly for anything other than 



speech or even for speech in a noisy environment. These "source coders" 
are an active topic of research and currently produce tolerable speech at 
output rates anywhere from 4kb/s up. A typical modern coder uses about 
SOMIPs of DSP capacity. Coders typically operate on 20msec frames of data, 
and hence add at least that much c elay to the signal path. 
Source coders typically try to detect silence, and avoiding the transmission of 
silence typically saves about 50% cif bandwidth on average. At the decoding 
side it is conventional to replace silpnce with "comfort noise" so that the 

live. / 

music, because it would be necessary to 



model a large number of different instruments alone and in combination, so 
early digital audio such as CD and DAT, just used waveform coding with 
enough bandwidth and dynamic range to satisfy (more or less) the human 
ear. Minidisc and digital compact cassettes brought in coding that reduced 
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CD bandwidth by a factor of about 10 b/ using psychoacoustics. 
Psychoacoustics applies, in particular, masking effects, where loud tones 
mask nearby ones for normal ears, and 
transmitting the inaudible components 
rate-adapted, as in RealAudio, and is a 



speech applications in the system of tho invention, 



Conventional filters, companders and s 



coded signal, so it is standard to decompress before filtering. In some cases 
this may be avoided, for example, N-WEy combining can take advantage of 
silence to do companding at no additional cost of bandwidth, and only needs 
to decode and recode during bi-directioial conversation. 
MPEG (Motion Picture Experts Group) 

MPEG coders do the same type of thinp for video signals that perceptual 
coders do for music. Components of a 

frequencies are digitized at low resolution, using 8*8 discrete cosine 
transforms to do the filtering, and using "motion estimation" so that 
components of an image that can be deirived from adjacent frames are not 
retransmitted. MPEG decoding is preferably left for the end-user's PC, 
because it is very demanding and because specialized hardware exists for it. 
However, the traffic properties are an inportant consideration in implementing 
the invention. Straight digitized television requires roughly 30 frames / sec * 



200 kpixels / frame * 3 colours * 8 bits / 
what 3G wireless is built to handle, but 
hence the 3G requirement for that rate, 



capacity when the image changes sudc enly. 
At the low-quality end, videoconferencing is usually done at 128kb/s. At this 
rate the coding process adds hundreds of msec of delay and the picture is 
poor. 

If there is high demand for full-motion video, then 5MHz slots will not have 
sufficient capacity, but 20MHz slots an(jl generous use of antenna diversity 
could support 10-40 users at that rate. 
The network operating system could initiate process s in the end-user's PC 
so that video s rvices can be set up eajsily. 
Oth r applications 



bandwidth can be saved by not 
This type of technique can also be 
good candidate for high-quality 



milar components will not work on a 



colour, for 144Mb/s. That is beyond 
MPEG2 gives similar quality at 2Mb/s; 
MPEG2 is also bursty, needing more 
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the 



Other applications such as animatejd 
locations, voice activation, automat 
signal shaping to compensate for 
or software in the system, are all Mown 
5 invention. 

Examples have been shown to 
but the number of variations is by no mearjs 
could be made for any telephony device, ii 
machines, pagers, point of sale computers, 
10 private branch exchanges. While particular 
have been shown and described, it is cleat 
made to such embodiments without depan 
invention. 

The invention could, also be implemented 

1 5 Internet and PSTN networks. For example 
the functionality of the invention similar to 
PSTN a specialized server could be attached 
implementations would not have all the behefits 
certain aspects of its teachings. 

20 The method steps of the invention 

machine code stored in a variety of formatjs 
Such code is described generically herein 
program for simplification. Clearly, the 
with the code of other programs, implemented 

25 calls or by other techniques as known in 
The embodiments of the invention 
or similar device programmed in the manner 
an electronic system which is provided wit 
Similarly, an electronic memory means su 

30 Access Memory (RAM), Read Only Memojry 
storage media known in the art, may be 
As well, electronic signals representing 
via a communication network. 



i the; 
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video, stereo input at the participant's 
c gain control (AGC) at the user's PC, and 
frequency response of certain devices 
in the art, and easily applied to the 



demonstrate various aspects of the invention, " 
complete. Comparable implementations 
including personal digital assistants, fax 
amateur radios, local area networks or 
embodiments of the present invention 
that changes and modifications may be 
ing from the true scope and spirit of the 

to a lesser extent on existing 
, Internet servers could be given much of 
; applications such as NetMeeting. On the 
to a class 5 switch. These 
of the invention, but could apply 

may be embodied in sets of executable 
such as object code or source code, 
as programming code, or a computer 
executable machine code may be integrated 
as subroutines, by external program 
art. 

may be executed by a computer processor 
of method steps, or may be executed by 
i means for executing these steps, 
h computer diskettes, CD-Roms, Random 
(ROM) or similar computer software 
programmed to execute such method steps, 
method steps may also be transmitted 



these i 
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It would also be clear to one skilled in the art that this invention need not be 
limited to the described scope of computers and computer systems. The principles of 
the invention could be applied to citizen's band radio, amateur radio, or packet radio. 
Again, such implementations would be clear t<j> one skilled in the art, and do not take 
away from the invention. 
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WHAT IS CLAIMED IS: 

1 . A system for teleconferencing comprising 

three or more user terminals, each having an audio input and an audio output; 
a telecommunications network interconnecting said user terminals and operable to 

transport data to and from said use ■ terminals; 
separate modular mixing software for each respective user terminal,, executing on 

said telecommunications network, emd operable: 

to receive separate audio signals frpm said audio outputs of the others of said 

user terminals; and 
to combine said separate audio signals into a signal for said audio input of 

said respective user termins I which correlates to the needs of said 



aid 



respective user terminal. 

2. A system as claimed in claim 1 , further 
modular connection management software 
said three or more user terminals 
including a connection proxy for 
said telecommunications network; 
each of said connection proxies executing 
to represent its owner's interests in 
recognizing the limitations o 



eai;h 



cind 

Dn said system and being operable: 
managing the teleconference by 
its resources. 



3. A system as claimed in claim 2, further 
a mapper for locating said separate modular 
terminal for execution on different 
network, trading off delay time in 
computational power available in 



A system as claimed in claim 3, wherein 
comprises multiple telecommunications 
and protocols, each of said multiple 
own connection proxy. 



comprising: 
for establishing interconnections between 
said separate modular mixing software, 
of said three or more user terminals and 



comprising: 
mixing software for each respective user 
rbuters of said telecommunications 
communicating data between routers with 
to maintain quality of service. 



order 1 



said telecommunications network 
networks with varied transport media 
telecommunications networks having its 



WO 00/19693 



28 



5. A system as claimed in claim 4, further 
negotiation management software including a 

terminals and said multiple telecommunications 
negotiation agents being operable: 
to execute on said system; and 
to representing the interests of each of 
negotiating communication over 
said negotiation management software being 

to identify negotiation agents participating 
to implement a negotiation discipline which 
negotiation agent to consider a 
accept or revise said communication 
to respond to said negotiation being successful 
communication contract. 



6. A system as claimed in claim 5, wherein 
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comprising: - 

rjegotiation agent for each of said user 
networks, each of said 



said three or more user terminals in " 
said telecommunications network; 
oberable: 

in a negotiation; 
allows each said participating 
Communication contract and either 
contract; and 

by executing said 



said separate modular mixing 



software is operable to combine said se parate signals into two or more audio 
channels which define a metaphorical physical space, each user terminal 
having a simulated position within said metaphorical physical space whereby 
individual users may be recognized by their particular position in said space. 



A system as claimed in claim 6, wherein 
from said respective user to emphasize 
the corresponding audio signal prior to 



combining. 



A system as claimed in claim 7, whereiifi 
comprises a personal computer having 
speakers, and said respective mixer software 
separate signals into two audio channe 



said mixer is responsive to a request 
a particular user's voice by amplifying 



at least one of said user terminals 
a stereo sound card and stereophonic 
is operable to combine said 

s. ' 
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9. A system as claimed in claim 8, wherein at least one*of said user terminals 



comprises a connection to a PSTN 
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network via two monophonic telephone 



lines, and said respective mixer sqf ware is operable to combine said 
separate signals into two audio channels. 



10 



opera sle 



A system as claimed in claim 9, whprein each said connection proxy 
comprises: 
multiple software agents each being 
a proxy object operable to instantiate particular 

in response to requirements of communications 
telecommunications system. 



mixing 



11. A server for teleconferencing comprising 
means for interconnecting user terminals 
terminals; 

means for executing separate modular mining 
terminal, said separate modular 
means for receiving separate audic 
others of said user terminal: j 
means for combining said separate 
input of said respective use 
said respective user terminal 



s nd transporting data to and from said user 



12. A method of teleconferencing comprising 
receiving, at a separate modular mixer representing 
more user terminals and executing 
separate audio signals from audio 
and 

combining said separate audio signals ink 

respective user terminal which correlates 
terminal. 



to perform a specific task; and 
ones of said multiple software agents 
made over said 



software for each respective user 
software including: 
signals from said audio outputs of the 
; and 

audio signals into a signal for said audio 
terminal which correlates to the needs of 



the steps of: 

a respective one of three or 
on a telecommunications network, 
Outputs of the others of said user terminals; 

a signal for an audio input of said 

to the needs of said respective user 
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13. A computer data signal embodied in a 
comprising a 

to perform the steps of claim 12. 
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farrier wave, said computer data signal 

a computer 



set of machine executable code being executable by 



14. 



A computer readable storage medium utoring a set of machine executable 
code, said set of machine executable code being executable by a computer 
server to perform the steps of claim 12. 
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FIGURE 3 
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